Joint extraction and prediction of fujisaki's intonation model parameters
نویسندگان
چکیده
This paper presents a joint extraction and prediction framework for intonation modeling applied to Fujisaki’s intonation model for text-to-speech conversion. Previous methods in the area extract the parameters of accent and phrase commands for each sentence. Then, these parameters are related to linguistic features for prediction. In our approach commands that share the same linguistic features are globally estimated. This approach intends to overcome some consistency problems of the extracted model parameters. The global nature of the parameter optimization avoids the interpolation step, which sometimes can produce a bias in the extracted parameters. Experimental results show that the higher consistency of the parameters result in a higher accuracy when the fundamental frequency contours are predicted.
منابع مشابه
Intonation modeling for TTS using a joint extraction and prediction approach
This paper presents a joint extraction and prediction framework for intonation modeling. The intonation model is based on a superpositional approach using Bézier curves. The components are attached to minor phrase and accent group. A greedy algorithm performs succesive partitions on training data using linguistic information. The parameters related to each partition are obtained using a global ...
متن کاملEstimation of the parameters of the quantitative intonation model with continuous wavelet analysis
Intonation generation in state-of-the-art speech synthesis requires the analysis of a large amount of data. Therefore reliable algorithms for the extraction of the parameters of an intonation model from a given F0 contour are required. This contribution proposes improvements concerning the extraction of the parameters of the quantitative intonation model developed by Fujisaki. The improvements ...
متن کاملNew rule-based and data-driven strategy to incorporate Fujisaki's F 0 model to a text-to-speech system in Castillian Spanish
We will present the analysis of a Spanish prosody database by estimating the parameters of Fujisaki's model for FO contours. These parameters are classified attending to linguistic features and they form the analysis database. When synthesizing FO contours we extract the linguistic features from the text and perform a k-Nearest Neighbour search. Linguistic feature comparison distance is trained...
متن کاملPrediction of intonation patterns of accented words in a corpus of read Swedish news
This paper describes an initial attempt at the construction of a data-driven model of Swedish intonation. The study is mainly concerned with model building and prediction of the intonation patterns of accented words in a corpus of read news in Swedish. Extraction of pitch information is achieved by performing a stylization of the pitch contours. The information is used to build a model for the ...
متن کاملPrediction of intonation patterns of accented words in a corpus of read Swedish news through pitch contour stylization
This paper describes an initial attempt at the construction of a data-driven model of Swedish intonation. The study is mainly concerned with model-building and prediction of the intonation patterns of accented words in a corpus of read news in Swedish. Extraction of pitch information is achieved by performing a stylization of the pitch contours. The information is used to build a model for the ...
متن کامل